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(N Abstract 

The Azema-Yor solution (respectively the Perkins solution) of the Skorokhod embedding 
P\| problem has the property that it maximises (respectively minimises) the law of the maximum 

of the stopped process. We show that these constructions have a wider property in that they 
also maximise (and minimise) expected values for a more general class of bivariate functions 
Ph F(W t , S t ) depending on the joint law of the stopped process and the maximum. Moreover, 

for monotonic functions g, they also maximise and minimise E[/ g(St)dt] amongst embed- 
£N) dings of fi, although, perhaps surprisingly, we show that for increasing g the Azema-Yor 

embedding minimises this quantity, and the Perkins embedding maximises it. 

For g(s) = s~ 2 we show how these results are useful in calculating model independent 
bounds on the prices of variance swaps. 

Along the way we also consider whether fi n converges weakly to /x is a sufficient condition 
for the associated Azema-Yor and Perkins stopping times to converge. In the case of the 
Azema-Yor embedding, if the potentials at zero also converge, then the stopping times 
converge almost surely, but for the Perkins embedding this need not be the case. However, 
under a further condition on the convergence of atoms at zero, the Perkins stopping times 
converge in probability (and hence converge almost surely down a subsequence). 

> 

O 1 Introduction 

&\ 

Let W = (Wt)t>o be Brownian motion, null at 0, and \i a centred probability measure. Then the 
Skorokhod embedding problem (SEP) (Skorokhod [20]) is to find a stopping time r such that 
the stopped process satisfies W T ~ \i. There are many classical solutions to this problem (for 
a survey, see Obloj [E]), and further solutions continue to appear in the literature, including 
most recently Hirsch et al [S]. Further impetus to the investigation of old and new solutions is 
derived from the connections between solutions of the SEP and model independent bounds for 
the prices of options (for a survey, see Hobson |11|). 

Given the multiplicity of solutions to the SEP, it is natural to search for embeddings with 
additional optimality properties. In particular, if ^ is a functional of the stopped Brownian 
path (Wt)o<t<r, then these constructions aim to maximise over (a suitable subclass of) 
embeddings of \i. For example, if F is an increasing function, and St = sup s<t W s , then the 
Azema-Yor solution [2] maximises E[F(SV)] over uniformly integrable embeddings, and the 
Perkins embedding |16] minimises the same quantity. 

Our goal in this paper is to extend this result to functions F = F(W T , S T ). Then, subject 



X 



to regularity conditions, our first result (Theorem 5.3) is that 
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Suppose F s (w, s)/(s — w) is monotonic decreasing in w. Then E[-F(W T , S T )] is min- 
imised (respectively maximised) over uniformly integrable embeddings r of fi by the 
Azema-Yor (respectively Perkins) embedding. 



This result is a tool in the derivation of our second result, Theorem 7.1 which, again subject 
to regularity conditions is as follows: 

Suppose g is increasing. Then E[Jq T g(S u )du] is minimised (respectively maximised) 
over uniformly integrable embeddings r of \i by the Azema-Yor (respectively Perkins) 
embedding. 

One approach to finding extremal values of K[F(W T , S T )] is to utilise the work of Kertz and 
Rosier |13| . Vallois [22] and Rogers [19] who characterise the possible joint laws of (W T ,S T ). 
These characterisations take the form of constraints on the possible laws of (W T , S T ), but that 
still leaves our problem as a constrained optimisation problem. In fact, there are parallels 
between Equation 3.2 of Theorem 3.1 of Rogers [19J, and some of the quantities that arise in 



our study, see Remark |5.7[ but we shall not make direct use of this connection. 

At first sight the second result above may appear counterintuitive. After all, for increasing g 
the Azema-Yor embedding maximises the law of g(S T ) so one might also expect it to maximise 
the law of L g{S u )du. However, the exact opposite is true, and the Azema-Yor embedding 



minimises the expected value of this quantity. We return to this issue in Remark 7.2, where we 
explain this phenomena. 

One of our tools for solving the above problems is to solve the problem in the case where fi 
has bounded support, and to approach the case of a general measure by approximation. In order 
to carry out this programme we need to analyse when and whether convergence of probability 
measures is sufficient to guarantee that the associated Azema-Yor and Perkins embeddings 
converge. This proves to be a delicate question. Under the additional (and necessary) hypothesis 
that J R \x\fj, n (dx) —7- J R \x\fj,(dx) then indeed the Azema-Yor embedding of fj, n converges almost 
surely to the Azema-Yor embedding of [i. However, this need not be the case for the Perkins 
embedding, and the sequence of Perkins embeddings of \i n may fail to converge on an almost 
sure basis. 



2 A variance swap on squared returns 

The original motivation for our study came from financial mathematics and the pricing of 
variance swaps, and one of the contributions of this article is to establish a link between variance 
swap bounds and Skorokhod embedding theory. The implications of this connection are the 
subject of related work (Hobson and Klimmek [T2]). Informed by the results presented here, but 
necessarily using different methods, the article |12j shows how to construct model-independent 
bounds and hedging strategies for a general family of variance swaps. In this section we outline 
the link between variance swaps and the second result from the introduction. 

Let X = (Xt)o<t<T represent the discounted price of a financial asset. Under the assumption 
of no-arbitrage, there exists a measure under which X is a (local)-martingale. Suppose there 
exists a (possibly discontinuous) time-change t — > A t , null at 0, such that X t = Ba % for some 
Brownian motion B. (If X is continuous then the existence of such a time-change is guaranteed 
by the Dambis-Dubins-Schwarz Theorem). Since X is a non-negative price process we suppose 
it has starting value Xq = Bq = x$ > 0. 

We suppose that we have a filtered probability space (Q, J-, E, f) such that B is a F-Brownian 
motion. Then X is adapted to the filtration G where Qt = J~A t ■ 
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Now suppose that we know the prices of put and call options with maturity T. Knowledge 
of put and call option prices with expiry time T is equivalent to knowledge of the marginal law 
of process at time T (see Breeden and Litzenberger [3]). Suppose that Xt ~ // an d that \x is 
centered at xo, and has support in M + We will determine bounds for the fair value of a variance 
swap given the terminal law /u. Note that if Xt ~ /U then At is a solution of the Skorokhod 
embedding problem for [i in B. 

Following Demeterfi et al. [7] we define the pay-out V = V((X s )o< s <t) of an idealised 
variance swap as 



V T 



d[X,X] t 



T 



dX[ 
Xt- 



+ 



E 

0<t<T 



( AX t 



(2.1) 



where AXt = Xt — Xt-, and X c is the continuous part of X. 

Let A c be the continuous part of A Note that dA% = (dX£) 2 = d[X, X] c t . Let S x = (Sf)t>o 
(respectively S B ) be the process of the running maximum of X (respectively B), and let I 
(respectively I B ) denote the corresponding infimum. Then we have Xt < Sf < S B and it 
follows that path-by-path with AB^ t = B^ t — B^ t _ that 



V T > 



> 



T d[X,Xl + ^ 



T 

n (5 



0<t<T 



dA$ 



+ E 



0<t<T 



qB 



(2.2) 



We suppose that X has a second moment. Then (Xt)o<t<T is a square-integrable martingale 
and we find that, 



E 



dAS 



+ E 



0<t<T 



qB 



E 



E 



dAj + AAf 

(^ t _) 2 



> E 



(sir 

T du 
(Si? 



(2.3) 



We say that r is an embedding of fi if r is a stopping time for which B T has law \x (we write 
B T ~ /i or /x = C(B T )). Let 5 = S(B,n) be the set of stopping times which embed /U, and 
let Sjji = Sui(B,n) be the subset of S(B,fi) for which (5tAr)t>o is uniformly integrable. The 
inequalities above imply that the fair value of Vt is bounded below by 



inf E 

t(ESui(B,h) 



du 

(ssr 



(2.4) 



Similarly, using the inequality I? < if- < X t we find that the fair value of Vp is bounded above 
by 

du 



sup E 

t^S ui (B ; ii) 



(2.5) 



lo (4 B ) 2 . 

This problem can be converted into a problem concerning the maximum S B by a reflection 



argument, see Section 8.1 
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Now let G(b,s) 



then by Ito's Lemma, 



G(B T ,S?) = G(0,0) + 



du 

Wy 2 



T 2Qff - B u ) 



dB„ 



It follows that if C At 2{S^ - B u ){S^)- 2 dB u is a uniformly integrable martingale then 



E 



du 



E 



Off - BrY 



and the question of bounding the fair value of Vt is transformed into a question of maximising 
or minimising expressions of the form ^[F(B T , S T )] over embeddings of /j,. We return to the 
calculation of the variance swap bound in Section |8.1[ 



3 Preliminaries 

Let (O, J 7 , E, f>) be a filtered probability space satisfying the usual conditions and supporting a 
Brownian motion W = (Wt)t>o with Wq = 0, and sufficiently rich that J-q contains a further 
uniform random variable which is independent of W. Let /i be a centred probability measure. 
To exclude trivialities we assume that \i is not <5o, the unit mass at 0. We say that r is an 
embedding of fj, if r is a stopping time for which W T has law \x (we write W T ~ fi or u = C(W T )) 
and we say that r is uniformly integrable if the family (Wt/\ T )t>o is uniformly integrable. 

Let S = S(W,fi) be the set of stopping times which embed fj,, and let Sui = SuiiW,^) be 
the subset of S(W,/jL) consisting of uniformly integrable stopping times. For Sjji ( W, n) to be 
non-empty we must have that \x is centred (i.e. L \x\[i(dx) < oo and f R x/i(dx) = 0). In this 
context (Brownian motion and centred target laws) a result of Monroe |14| gives that a stopping 
time is uniformly integrable if and only if it is minimal (in the sense that if r is minimal and 
a < r with Wo- ~ W T , then ex = r almost surely). The class of minimal stopping times is a 
natural class of 'good' (in the sense of small) stopping times. 

For the Brownian motion W, started at 0, we write H x for the first hitting time of x, and 
for a set A, H A = M{u > : W u G A}. 

For a process (Y t )t>o and a stopping time a we write Y a = (Yf) t >o for the stopped process 

Given a centred probability measure /i, let be a random variable with law fi and define 
C(x) = C^{x) = E[(X M -x) + ] and P(x) = P M (x) = E[(x-X (U ) + ]. Then C and P are monotonic 
convex functions with C(0) = P(0). Then C/(x) = U^(x) = ~E\\X^ — x\\ = C(x) + P(x) is (minus) 
the potential associated with [x. Conversely any convex function U with \xm x -+±oo(U(x) — \x\) = 
is the potential of some centred probability measure [i (Chacon [1]). 

If n has an atom at zero then we write /i* for the measure obtained by omitting the atom 
at 0, and then rescaling to get a probability measure. Thus fx* (A) = (jl(A \ {0})/(l — /x({0})). 
Finally, we write x = x^ for the upper limit on the support of \i (so x M = sup{x : C^(x) > 0}) 
and x = x^ for the corresponding lower limit x^ = inf{x : f«(x) > 0}. 



3.1 The Azema-Yor solution 

For x > 0, up to the upper limit on the support of fi, define j3 = (3^ by 

CM 



(3{x) = argmin y<x — . (3.1) 

y x — y 
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Then f3 is an increasing function with (3(x) < x. Where the argmin is not uniquely defined 
it is not important which value we choose. However, we fix one by insisting that (3 is right 
continuous, or equivalently by choosing the largest value for which the minimum is attained. 
Observe that at x = 0, f3 takes the value of the infimum of the support of /i. For x equal to, or 
to the right of, the upper limit on the support of /i we set (3(x) = x. 

For an increasing function /3 : M + i— > R with (3(x) < x let tr be given by 

rp = inf{n : W u < (3(S U )}. (3.2) 

Then t ay = t ay , the Azema-Yor stopping time for /i, is given by t ay = rg^. Thus we have 
r« E S(ji(W, /i), and moreover, for F increasing, rg maximises E[i ? (5' T )] over r S Suj(W, fi) 
(Azema-Yor [2J[l], Rogers (TB]). 

Note that rg does not maximise this quantity over all embeddings, but it does give the 
maximum over uniformly integrable (i.e. minimal) embeddings. 




Figure 1: For each x, the value of f3(x) is determined by finding the tangent line to originating 
at x: (3{x) is the horizontal co-ordinate of the point of contact between the tangent line and C M . 
(If Cfj, includes a straight line section then this point of contact may not be uniquely defined in 
which case we take (3(x) to be the largest value of the horizontal co-ordinate at which contact 
occurs.) The stopping time associated to this construction is given by the first time that an 
excursion from the maximum crosses below f3. 



Let b = be the right-continuous inverse to (3. Then b is the barycentre function and for 
x < x^, b(x) is given by 

b(x)=E[X fi \X li >x]. (3.3) 

The barycentre b(x) is defined up to the upper limit of the support of /i and is a non-negative, 
non-decreasing function with b(x) > x. We set b(x) = x for x > x^. (The reverse barycentre 
b(x) = K[X\X < x] is defined analogously to the barycentre.) 



It is more standard to define the barycentre function as in (3.3) and to set (3 to be the inverse 
barycentre function, but the two approaches are equivalent, and our approach via potentials 
allows for a unified treatment with the Perkins construction in the next section. 

If /i has an interval with no mass, then b is constant over that interval and (3 has a jump. 
If \i has an atom at x then b has a jump at x (unless the atom is at the upper limit x of the 
support of n in which case b(x) = x) and (3 is constant over a range of s. From the definition 



of Tj3, see (3.2), and excursion theory (see Rogers [19, Equation 2.13]), we have 



-»(-jT^$w)-«*» 2 ') (3 - 4) 

and then also F(S T0 > s) = P(W T/3 > /3(s)) = fi(f3(s), oo). Note that it does not matter which 
convention we use for f3(s) here since /i places no mass on (f3(s— ), (3(s+)). 
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Example 3.1. If fi = U[-l,l}, then C^x) = (x - l) 2 /4 and P M (x) = (x + l) 2 /4 (at least 
for — 1 = Xfj, < x < x = 1). Then the barycentre function is given by o(x) = (x + l)/2 for 
— 1 < x < 1 and hence f3(s) = 2s — 1 /or < s < 1. It follows that S t ay = b(W T Ar) is 
uniformly distributed on [0, 1]. 

Lemma 3.2. // \x 'places mass on (x,oo) then (r — /3(r)) _1 is integrable over [0,x]. 



Proof. This follows immediately from (pT4|) and P(5 rj3 > x) > P(W r g > x) > 0. □ 



3.2 The Perkins solution 

For x > define a+ = a + : M+ — >■ M_ by 



C„(x)-P M (y) 



cr(x) = argmin v<0 — , (3.5) 

y x — y 

and for x < define a~ = a~ : R_ — > by 

- / \ Pu( x ) ~ C u (y) , . 

a (x) = argmax >0 — — . (3.6) 

y y — x 

Then are monotonic functions. If the argmin (or the argmax) is not uniquely defined we take 
the largest value (in modulus) for which the minimum is attained; in this way : M± i— > IR^ 
is right-continuous. Again, none of the subsequent analysis will depend on this convention. For 
convenience we will sometimes write a as shorthand for a^ 1 . 

If Pp, (respectively C M ) is differentiable at q + (x) (respectively a~(x)), then a + (x) (respec- 
tively a~(x)) satisfies 

C, W -P («+(*)) = 
x — a + {x) * 

(respectively P^(x) — C fl (a~(x)) = C^(a _ (x))(x — a _ (x))). 

Let be the inverse to a* 1 and let a(w) = w for w > and a(w) = a + (w) for w < 0. Let 
I be the infimum process for W so that It = inf s <t W s . 

For a pair of monotonic functions a + : R+ i— > R_ (non-increasing) and a" : R_ i— > R + 
(non-decreasing) define the stopping time 

r a = mf{u : W u < a + (S u )} A inf{u : W u > a~ (/„)}. 

Suppose /i does not have an atom at zero. Then the Perkins [E] embedding r = T P = 
r p (/i) is given by r p = r Qfi . 

If /i has an atom at zero, then we use independent randomisation to set t p = with 
probability fi({0}); and otherwise t p = r a ^. More precisely, in the case where \i has an atom 
at zero we set the Perkins embedding to be 

P _ J if Z < /x(0), 



T 



t Qm ifZ>/i(0), 



where Z is a uniform random variable which is measurable with respect to J-q. Here a„ are 



the quantities defined in (3.5) and (3.6) for [i. Note that if fi* is obtained from fj, by removing 



any mass at zero, and rescaling to give a probability measure, then although C M * and P^* are 
scalar multiples of and P^ respectively, nonetheless we have = a^. 

Note that if \x has an atom at zero then we need J-q to be non-trivial in order to be able 
to define the Perkins embedding. Note further that since there are potentially many uniform 
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a"(x) 



x b(x) 



Figure 2: For x > 0, a~(x) is the horizontal co-ordinate of the point where the tangent line to 
C emanating from b(x) intersects with P. (Alternatively, we may consider the inverse a~ of a~: 
for y < 0, ot~{y) > is the horizontal co-ordinate of the point such that the tangent to C at 
a~(y) crosses P at (y, P(y)).) These definitions extend naturally to the case where the convex 
function C has kinks or straight-line segments. Similarly, a + {z) is found by drawing tangents 
to P emanating from the reverse barycentre function evaluated at z < and determining 
intersection points with C. The stopping rule associated with this construction is to stop the 
Brownian motion when its running maximum or minimum exit the region determined by a + 
and a~ . 



random variables Z which are measurable with respect to J-q, if /i({0}) > then the Perkins 

embedding is not unique. Sometimes it is convenient to think about the Perkins embedding 

p z 

associated with an identified J-q random variable Z, in which case we write ' instead of just 

The results of Perkins [16] show that t p E Suj(W,/j,) and moreover, for F increasing, 
t p minimises E[i 71 (S , r )] over r G S(W,/jl), and not just Sjji{W,ix) (Perkins [16], although the 
representation via (3.5) and (3.6) is due to Hobson and Pedersen |10j). 



Example 3.3. If \i = U[— 1,1] then P = P^ and C = are as given in Example 3.1 and, 
from (3.7), a + (s) is the unique root of the equation P'(a)(s — a) = C(s) — P(a). It is easily 
verified that this root is given by a + (s) = s — 2y/s. Similarly, a~~ (i) = i + 2-v/jIj. It can be 
shown that F(S Ta > s) = F{W Ta > s) = F(W Ta <s- 2^~s) = 1-^/7). 

Example 3.4. Notwithstanding the above example, in general it is difficult to derive an explicit 
form for the stopping boundary associated with the Perkins stopping time. Here we give a second 
example where analytic expressions, albeit complicated ones, can be derived. 

Suppose the target law is a centered Pareto distribution with support [— l,oo) and density 
function f(x) = 2(x + 2)" 3 . Then for k > -1, C(k) = (2 + k)- 1 and P(k) = k + {2 + k)' 1 , and 
for k<l, C(k) = -k, P(k) = 0. 

Then, for the Azema-Yor embedding, (3 solves C{(3) = (s — (3)\C'(/3)\ and f3(s) = (s/2) — 1. 

For the Perkins embedding, a + (s) solves P'(a + ) = (C(s) — P(a + ))/(s — a + ) and we have 
(after some algebra) 

, . , -2s 2 - 5s + Vs A + 6s 3 + 12s 2 + 8s 

a Is) = 7, . 

w 2s - 1 + s 2 

m, . , _ . -3i-2i 2 +^/-(i 4 +6i 3 +12i 2 +8i) 

Ihe expression for a is, a [i) = 2^+i+i 2 ■ 
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If li has an interval in M + (respectively M_) with no mass, then a~ (respectively oq.) has a 
jump (unless that interval is contiguous with zero, in which case starts at a non-zero value). 
If lx has an atom in (0,oo) (respectively (— oo,0)) then a~ (respectively a+) is constant over a 
range of values. 

Lemma 3.5. Suppose x > 0. If fi places mass on [x,oo) then (r — a + (r)) _1 is integrable over 
(0,x). 

Proof. We have (W u > a + (S u ); Vu < H x ) D (t q > Hx) 71 (W Ta > x) and then by excursion 
theory, recall (|3.4|), 



exp f- / ^— ) = F(W U > a+(S u );Vu < H*) > n([z,oo) > 0. 

V Jo r-a+(r)J 



□ 



4 Convergence of measures and convergence of embeddings 

Let (/i n ) n >i be a sequence of measures and write U n , f3 n and a n as shorthand for U^ n , /3^ n and 
a^ n , with a similar convention for other functionals. 

Suppose that, for each n, fi n is centred and that (Li n ) n >i converges weakly to li, where ii 
is also centred. Then it does not follow that U n — > Un, nor that f3 n — > (3^, nor that a n — > a^. 
However, with the correct additional hypotheses, then these types of convergence are equivalent. 

Our first key result is the following. 

Proposition 4.1. Let (/in) be a sequence of measures such that Li n =^ li and^^X^W — > E[|^|]. 
Then b n (x) —> b{x) at continuity points x < x ofb. 

Proof. Chacon [3] shows that if Li n =^ fi and U n (0) — > U(0) then U n — >■ U pointwise. It follows 
trivially that C n — > C pointwise, where C n (x) = C^ n {x) and C(x) = C^x). 

Recall that x is a discontinuity poi nt of b if and only if there is an atom of /u at x. Suppose 
x < x is a continuity point of b. Then ( 3.3| gives b{x) = x + and 

b n (x) = x + — -> x + — = b(x). 

fi n ([x,oo)) n(\x, oo)J 

□ 

Corollary 4.2. Lei (fi n ) be a sequence of measures such that Li n =^ /U and E[|^/iJ] — >• E[|^|]- 
T/ien /3 n (s) — > f3(s) at continuity points s < x of {3. Moreover, if x < oo then for each z > x, 
liminf /3 n (z) > x. 

Proof. S ince b n (x — 6) <C x -t- e for sufficiently large n we have for these same ?t> that {3 n {x -\- e) ^ 
x — e □ 



Corollary 4.3. Under the assumptions of Proposition J^.l, rg n — > almost surely. 

Proof. Let L> be the set of discontinuity points of (3. If S T/3 ^ D then W T(3 = (3(S T/3 ) and it 
follows that 

(w : A r^) C ( w : S r/3 El))u(u: <S T/3 ^ L>, = /3(5 T/3 ), r^) . 
For any stopping time a write Let H" = inf{u > a : W w = x}. 
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Case 1: x = oo. 

Note that since /3 is increasing, D is countable and P(S T/3 €. D) = 0. 

First we argue that on (u : S T/3 = x) we have that for sufficiently large n, S Tp > x: since 
there are only countably many values of s < x on which the value of W u gets below S u — s, 
and on each of these excusions W stays above (3(S), for sufficiently large n, W must stay above 
/3 n (S) also. 

Hence S Tp > S Tp almost surely. Then on {oj : S Tp = x ^ D,W T/3 = (3(x)}, we have 
Tp n {u) — > Tp{uj) unless inf{W u : Tp < u < Hg } = W Tfj = (3{x) and (3 n (x) < f3{x) But, almost 
surely, on any interval of positive length Brownian motion goes below its starting value. In 
particular, the set (oj : S Tp £ D, W Tp = P(S T/3 ),Tp n Tp) has probability zero. 
Case 2: x < oo and fj,({x}) = 0. 

The only paths for which issues of convergence might be different to the previous case are those 
for which S Tfj = x. But since ji has no atom at x, P(S* Tj3 = x) = P(W T/3 = x) = and rp n — > rp 
almost surely. 

Case 3: x < oo and fi({x}) > 0. 

In this case /3(x— ) := lim^ j3(y) < j3(x) = x. We show that on the set (S Tfj = x) we have 
\\m.Tp n = rp, almost surely. Off the set (S T/3 = x) convergence follows exactly as in the previous 
cases. 

First we argue that limsup n S T/ln < x almost surely. Fix z > x, then given < e < z — x, 
there exists N such that for n > N, (3 n (x + e) > x — e. Hence, for sufficiently large n, 

(u : S Tf}n (co) >z)c(uj: inf{W„ : H £+e < u < H z } < x - e) . 

But 

P (mf{W u : H £+t <u<H z }<x-e)<exp (- [* ^ = 2 -^—. 

V Jx+eV ~ (x - e)J z-x + e 

By choosing e small compared with [z — x) we deduce that limsup n S T/3n < z for any z > x. 

Now we argue that on S T/3 = x we have lim inf W Tf)n > x almost surely. Coupled with the 
result from the previous paragraph we can then conclude that on W Tp = x we have Tp n — > = 

Given 5 and e < x — (3{x—) — 5, there exists N such that for all n > N, (3 n (x — e) < 
(3{x-) + e < x - 5. Then 

(uj : W T/3n (lj) < x - S, S T/3 (oj) = x) C (oj : inf{ W u : H £ ^ e <u< H^} < x - 5) 

U(w : S Tpn < x - e, S T p = x). 

By similar arguments to those in Case 1 we can prove that the final event has small probability. 
Moreover, using that the fact that the probability that an event occurs is smaller than the 
expected number of times that it occurs, 

P (w : mf{W u : H £ _ e < u < H £ } < x - 5) < f _ ^ _ = ln(6/{5 - e). 

By choosing e compared to S this probability can be made arbitrarily small. □ 

Note that if rp n — > rp almost surely, then by the continuity of Brownian motion W Tp — > W T/3 
almost surely and fi n fi. 

We can summarise the results as follows: 

Proposition 4.4. Suppose that (/x n )n>i o,nd fi are centred and that E[|^„| — > E[|^|]- Then 
the following are equivalent: 
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(i) Mn^M andEHX^J] ->E[|X M |]; 
(n^ U n {x) —7- U^(x) for each x € M; 

(mj /3 n — t- /3 a£ continuity points s of (3, provided s is less than or equal to the upper limit on 
the support of n; 

, a.s. 

fit;; 7ft, — > t p ; 

(v) \^lf T , 

Now we want to prove a similar result for the Perkins embedding. 

Lemma 4.5. Let (/i n ) n >i be a sequence of centred probability measures such that fj, n => \i and 
E[|X„||] — > K[\X^\]. Then a^(x) — > a ± (x) at continuity points x £ (x,x) \ {0} of a. Moreover 
a^(x) —7- a =t (x) at non-zero continuity points x < x < x of \i. 

Proof. We prove the result for (a^,a + ), the other case being similar. The extension from 
to follows as in Corollary 4.2 



Again we have that x < is a discontinuity point of a + if and only if there is an atom of fi 
at x. Suppose that x is not an atom of //. Then, recall (3.7), a + (x) is the unique solution in z 
of P(x) + P'(x)(z — x) = C{z). Moreover, for any a n (x) G (a+(x+), a+(x— )) 

P„(x) + P' n (x+)(a n (x) - x) > C n (a n (x)), 
P n (x) + P' n (x-){a n (x) - x) < C n (a„(x)). 

Suppose a+(x) — > 7 (down a subsequence if necessary). Then, since P^(x±) — > P'(x) 

P(x) + P'(x)( 7 - x) > C( 7 ) > P(x) + P'(x)( 7 - x). 

Hence 7 = a + (x) and a+(x) — > a(x). □ 

Proposition 4.6. Suppose that (/U n )n>i snd // are centred and that E[|A"^J — > E[|^«|]- 

(a) Suppose there exists an open interval I containing such that /U n (I) = = 0. Then 
the following are equivalent: 

(i) n n => /x and E[|^J] -> E[|^|]; 
(nj U n (x) — > U^{x) for each x € M; 

(Hi) — >■ a 1 * 1 a£ continuity points of (X s1 which lie within the range of the support of \i; 



P Q-g- „P. 



(v) W t p ^f , 

f'oj More generally, suppose \x n ^> [i and E[|^„|] — > E[|-X^|]. Then, — > a^ 1 at continuity 
points of which lie within the range of the support of fj,. 

Suppose further that /J n ({0}) —> /i({0}). Then there exists a sequence of Perkins embed- 
dings of fi n such that converges in probability to a Perkins embedding of \i. In 

P Z \ 

particular, if Z n converges in probability to Z then the Perkins embeddings (r^ n ) n >i 

p z 

converge in probability to the Perkins embedding of \x. 

Thus, if n n => fi, E[|X M J] -> E[\X^\] and /x n ({0}) -> /i({0}) then if (r^ Z ") n >i is a 

sequence of Perkins embeddings o/(/i n )n>i then there exists a subsequence nk along which 
p z 

limr A1 ^ fe nfc exists almost surely, and is a Perkins embedding of \i. 
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Proof. For Part (a) the equivalence of (i) and (ii) follows as before. Lemma 4.5 gives that (ii) 
implies (Hi). It follows from the pathwise construction of T a „ (and the existence of the interval 
/ which is not charged by \i n so that = r an ) that T^ n — > almost surely and hence we 
have (Hi) implies (iv). The continuity of Brownian motion allows us to deduce (v), from which 
(i) follows immediately. 

For Part (b) the statement about the convergence of follows as before. For the other 
results, suppose first that /j({0}) = and /_t n ({0}) = for all sufficiently large n. Recall that 
r a = inf{u : W u < a + (S u ) or W u > a~(I u )} and for r\ > define the stopping time 

Pa,T] = T ari 

where cd"(s) = min{a + (s), — rj\ and a~(i) = max{a~(i), rj}. 

We have that a n — > a at non-zero continuity points. Then, with = ^max{^a^(s),jj}, 
a n,r] ~~ ^ a ri a ^ continuity points and by the results of Part (a), p an ,ri — > Pa,ri almost surely. In 
particular, given 5, e > there exists No such that for all n > Nq 

F(\p an , v -p a , v \ > e) < 5/2. 

Note that on \ W Ta \ > r] we have p a >q = r a with a similar statement for a n . We can choose 
rj > so that p([— 2r], 2rj\) < 5/6 and then N± so that for n > Ni, /j ra ([— 7/, 77]) < 5/3. Then 

(\r an ~ r a \ > e) C (\W T J < rj) U (\W Ta J < t?) U (\r an - r a \ > e, \W T J > rj, \W Ta J > rj) 

and the set (\r an — r a \ > e) has probability at most 5. 

It follows that Ta n — > T a in probability, and hence that there is almost sure convergence 
down a subsequence. Furthermore, down the same subsequence W Tan — > W Ta almost surely. 

Now suppose that p({0}) = and that lim/i n ({0}) = 0. Recall the definition of /j* as 
the measure jj, n with probability mass at zero removed, and then rescaled to be a probability 
measure, and note that a M * = a^ n . Then also //* =>■ /j, and U^* — > pointwise. 

Then, ™ = for Z n < /i n ({0}) and ™ = otherwise, so that r^ Zn ->■ r Q in 

p z 

probability. Moreover, down a subsequence, " — > r a almost surely. 

It remains to consider the case where p({0}) > 0. For e < 1, writing A n = (Z n < 
p n ({0}),Z > /z({0})) and B n = (Z n > /x n ({0}),Z < p({0})), 

(\rff n ~ t?> z \ > e) C A n U B„ U (Z n > Mn({0}) 5 Z > M ({0}), |r Qn - r a | > e) 

and n — > r M ' in probability. As before, there is almost sure convergence down a subse- 
quence. □ 

Remark 4.7. One easy and natural way to guarantee that Z n — > Z is to take Z n = Z with 
probability one, or in other words to use the same independent randomisation variable for each 
embedding. 

Remark 4.8. Suppose that fi is less than or equal to v in convex order (we write fi < cx v). Then 
Ufj, < U v . However, it does not follow that > (3 U , and so it does not follow that t^ y < r AY . 
Similarly, we do not have that la^l < |a^| nor <t„. 

Nonetheless, given fi it is possible to choose \i n increasing to /j in convex order and such that 
the barycentres are decreasing and hence the stopping times t ay are monotonically increasing 
and converge to [i. This idea is used extensively in Azema and Yor [2], see also Revuz and 



Yor |17l Section VI. 5], and also below in the proof of Theorem 7.1 
Similar remarks apply for the Perkins embedding. 
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Example 4.9. In Proposition 4-4 it does not hold that f3 n (s) —> f3(s) for s beyond the upper 
limit on the support of fj,. 

Suppose fj, = 5(^1 + <^-i) an d fJ-n = (1 — n~ 2 )^(5\ + 6"_i) + n~ 2 ^(5 n + S- n ). Then 17^(0) = 1 
and U n (0) = 1 + n" 1 - n~ 2 -> 1. 

We have b n is piecewise constant, and b n {x) = for x < —n b n (x) = n/(2n 2 — 1) for 
—n < x < — 1, b n {x) = 1 + n _1 — n~ 2 for — 1 < x < 1 and b n (x) = n for 1 < x < n. Then 
fln(s) — > /3oo(s) where /3oo(s) = — 1 for s < 1 and Poo(s) = 1 for s > 1. In contrast, j3(s) = — 1 
/or s < 1 and /3(s) = s for s > 1. 

Example 4.10. //«„ a^, C^n(O) U^O) then in general [i n 7^ 

Suppose n = p(5± + <5_i) + (1 — 2p)So and fi n = q(5i + 5-±) + (1 — 2q)5o. Then a n = a M but 
fi n ^ fj, unless p = q. 

Example 4.11. Suppose a n —> at continuity points of and U n (0) —> U^(0) but // n ({0}) 
does not tend to ^({0}). Then it does not follow that r an converges almost surely, although even 
then we may still have fi n =4> /i. 

Let \x = |(o"i + <5_i) + \5q, and for n > 1 let {i n consist of masses of size 

n + 1 1 n — 1 



An ' 2 ' 4re 

at {— 1, 1/n, 1} respectively. Then a^{x) = =pl, a^(x) = —1 and a~(x) = 1/n for —1/n < x < 
and a^(x) = 1 for x < —1/n. Further, r a = H±\ and 

H\/ n if H l / n <H_y n \ 
#_1 if H 1/n > H_ l/n , and H^ 1 < Hi; 
Hi if H l/n > H_ l/n , and H x < H-t. 

Then, if E n is the event that T an = Hi/ n then F(E n ) = 1/2 and for n > m, 
F{E n \E m ) = F(E m \E n ) = F 1/n (H 1/m < H_ l/m ) = . 

Hence F(E n n E^) = (n — m)/An which does not tend to zero as n — >■ 00 for fixed m. Hence 

P(|r a „ - r Qm I > e) > F(H ±1 - H ±l/2 >e,E n n E c m ) -fr 
and the sequence {T an ) n >\ is not Cauchy in probability. 

Example 4.12. Suppose a n — > at continuity points of and U n (0) — > U^(0) and ^n({0}) = 
= /i({0}). // there is no interval I containing on which ^ n {I) = = then it need not 
follow that T a „ —> T a almost surely, although there is convergence in probability. 
Let fi = U{—1, +1} and for n > 2 let [i n consist of masses of size 

n(l + 2" n ) 1 n(l-2" 



2(1 + n) ' 1+n' 2(1 + n) 
at { — l,n2~ n , 1} respectively. Then a =t (x) = qpl ; a+(x) = —1 and a~(x) = n2~ n for —2~ n < 



-2~ n . 


Further, 


T a = H±\ and 


H n 2~n 


if H n2 - 


-n *\ H_ 2 —n; 




if H n2 - 


-n > H_ 2 -n, an 


Hi 


if H nT 


- n > H_ 2 - n , an 
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Then, if E n is the event that T an ^ T a then P(E n ) = l/(n + 1) and for n > m, 

I n2~ n + 2~~ m n2 r ' 
F(E m n E n ) = F(E n )F(E m \E n ) = o _ m { — = F(E n )F(E m ) + 



(1 + n) m2- m + 2~ m v ' v ' (l + n)(l + m) 

Then by the Kochen-Stone Lemma (Durrett JH Exercise 1.6.19]), E n happens infinitely often, 
almost surely. In particular, almost surely, T^ n does not converge. 



5 Objective functions as terminal values 

Our goal is to prove that for a suitable class of bivariate functions F(w,s), the Azema-Yor 
and Perkins embeddings, which are well known to maximise and minimise E[i ? (W r , S T )] in the 
special case where F does not depend on w and F is increasing in s, continue to optimise this 
quantity even when there is non-trivial dependence on w. 

Assumption 5.1. Throughout we assume that F : {(w,s) G K x M + ;u> < s} i— )■ is a 
continuous function which is bounded on compact sets. We further assume that the partial 
derivative F s exists and is continuous and bounded on compact sets. 

We are interested in functions F which are monotonic in the following sense (note that in 
our terminology increasing is a synonym for non-decreasing). 

Definition 5.2. F satisfies F-MON| or F-MON| if 

F-MONf F s (w, s)/(s — w) is monotonic increasing in w. 

F-MON| F s (w, s)/(s — w) is monotonic decreasing in w. 

For r < x < oo and rj £ {/?, a + } define 

_ F 8 (n(r),r) 
r — r}(r) 

A r) (s) = Jq X v (r)dr and A^ (s) = Jq r\ v (r)dr. Set A v = sup s<£ |A^(s)|. Define s) = 

Jq A r) (r)(r — w)dr; whence ^(io, s) = A^\s) — wA v (s). Finally, define £p(w) by 

£p(w) = F(w,b(w)) - $p(w,b(w)) 

and £ a + (w) by 

£a+(w) = F(w,a(w)) - $ a +(w,a(w)) 

where a{w) = w for w > and a(w) = a + (w) for w < 0. Note that £p(w) (respectively £, a {u>)) 
does not depend on the convention chosen for b(w) (respectively a + (w)). 



5.1 Target laws with bounded support 

In this section we suppose \x has bounded support so that x and x are finite. This assumption 
will be relaxed in the next section. 

Theorem 5.3. Suppose that \x has bounded support and that F-MON'l holds. Then 

sup E[F(W T ,S T )] = E[F(W t ay,S t ay)], (5.1) 
inf E[F(W T ,S T )} = E[F{W t p,S t p)\. (5.2) 



13 



Remark 5.4. In the case where p has no atoms (so that the argmin in (3.1 ) is strictly increasing 
and E[X|X > x] = E[X\X > x]) then we can write 



E[F(W T8 ,S T0 )}= / F(w,b^(w))p{dw 



(5.3) 



This formula need not hold if p has atoms. 

In cases where p has a strictly positive density p on (±, x) and j3 is differentiable, the 



expression in (5.3) can be rewritten as 

E[F(W Tfj ,S Tf} )] = [ F(p(s),s)F(S^ eds)= [ F(/3( S ), S )p(P(s))^(s)ds (5.4) 

Jr JR 

where we use the fact that in the atom-free case 

p([P(s), oo)) = P(W T0 > 0(a)) = P(S Tp > s). 
A similar remark applies to E[F(W t p, S t p)] = J R F(w, d(w))p(dw). 



Remark 5.5. The requirement that the infimum in (5.2) is taken over r E Sui(W, p) (and not 



over all embeddings) is necessary, as can be seen by considering F(w, s) = — (s — u>) 3 . However, 
if we restrict attention to functions F which are increasing in s, then we may replace the innmum 



in (5.2) with an infimum over all embeddings. 



The key to the proof of the Theorem is the following lemma. 
Lemma 5.6. Suppose F satisfies F-MON^. Then, for all w < s 

£ a + (w) + $ Q + (w, s) < F(w, s) < $ p (w) + $p(uu, s) 
with equality on the left at w = s and w = a + {w) and equality on the right at w = f3(s). 
Proof. For n £ {f3,a + } define 

L v (w,s)= F(w,s) - £ v (w) - X n (r){r-w)dr 

Jo 



(5.5) 



We will show that L a +(w, s) > with equality at w = s and w = a + (s), and Lp(w, s) < 
with equality at w = f3(s). 

Consider the latter inequality first: 



Lp(w,s) = F(w,s) - £p(w) - \p{r){r - w)dz 

Jo 

= F(w,s)-F(w,b(w))+ drF s ((3(r),r) — - 

Jo r — p(r) J 

f F s (w,r) F s ((3(r),r) \ 
b(w) I r-w r-(3{r) J 



drF a (P(r),r) 



r — w 



r — (3(r) 



If h(w) < r < s, then since f3 is increasing, w < (3{r) and by F-MONf the integrand is negative. 
If s < r < b(w) then w > h{r) and the integrand is positive. Thus Lp{w,s) < as required. 
Clearly, there is equality at s = b{w). 

For L n + a similar calculation to the one above shows that 



L a+ (w,s) 



F s (w,r) F s (a + (r),r) 



a(w) \ r-w r-a+(r) 



(r — w)dr. 
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To see that L a +(w, s) > 0, consider w > and w < separately. For w > 0, a(w) = w and for 
w < r < s, a + (r) < a + (w) < w so that the integrand is positive and L a +(w, s) > 0. For w < 0, 
a(w) = a(w), and then if a(w) < r < s, we have it? > a + (r) and the integrand is positive. 
Otherwise if s < r < a(w), w < a + (r) and the integrand is negative. In either case, allowing 
for the limits on the integral, L a +(w, s) > 0. Equality holds at w = s and w = a + (s). □ 



Remark 5.7. Essentially, the idea behind Lemma 5.6 and the proof of Theorem |5.3| is to interpret 



the embedding property and Doob's (in)-equality for the martingale W as linear constraints on 
the possible joint laws of (W T , S T ), with associated Lagrange multipliers. Thus, if the joint law 
is given by v{dw, ds), then j s>r (w — r)v(dw ', ds) = (which is equivalent to (3.2) in Rogers [T9]). 
There is an identity of this form for each r and when they are integrated against a family of 
Lagrange multipliers A r? (r) we obtain 

= / A J? (r) / (w — r)u(dw,ds) = / u(dw,ds) / X v (r)(w — r)dr. 

JO Js>r J Jo<r<s 



The integrand of this last expression appears as the last term in (5.5). 



It remains to prove Theorem 5.3 The main idea for the proof of the Theorem is that provided 
that A/9 and A a + are finite then for r G Sjji(W, fx) both (& a +(W[, Sl)) t >o and (^p(W[, SJ)) t >o 
are uniformly integrable martingales. (By Ito's formula, d$ r] (Wt, St) = —A v (St)dWt since the 
finite variation term involves the product (St — WtjdSt and when S is increasing we must also 
have S t -W t = 0.) It follows that E[$ /3 (W / " r , S T )\ = and 

ntaAWr)} < E[F(W T ,S T )} < E[^(W T )} 

which, given the forms of £ Q and £^ leads to the first result given in the introduction. 

Remark 5.8. The processes ($ Q + (W[, Sl))t>o and (^p(Wf,Sl))t>o belong to the class of 
Azema-Yor martingales. A martingale M = (Mt)t>o is an Azema-Yor martingale if Mt = 
G(S?) - (S? - Xt)g(St) for X a martingale and G' = g, see 0. 

Proof of Theorem 15.31 

Consider first the bound associated with the Azema-Yor embedding. A^ depends on the 
combination of ji and F. 



Suppose that fi has an atom at x. By Lemma 3.2 (r — /3(r)) _1 is integrable near zero so 
that if [x has an atom at x then r — (3(r) is bounded below for r < x and An < oo. Since 
r G Sui(W,ii) implies (W[)t>o is bounded, and since A^(s) and AW(s) are bounded, we have 
that $/3(W i r , SJ) is a bounded local martingale and hence K[^(W[ , SJ)] = 0, which can be 

re-expressed as E[A^(S* r )] = E[W T A y g(5 r )]. In view of Lemma 



5.6 



we have 



F(W T , S T ) < &(W T ) + $p(W T , S T ). (5.6) 

Thus 

E[F(W t ,St)} < [ ^(w)n{dw). 



Note that for r = rg we have equality in (5.6) and hence equality in this last expression. 

Now suppose there is no atom at x. Fix r G SuiflV, pi) and let a n = r A H s ._ 1 / n and 
\i n = £(W an ). Then U^ n — > for each x and by bounded convergence we have both 

E[F(W T ,S T )] = E[limF(W an ,S Un )] = limE[F(W an ,S an )] 

and 

E[F(W T AY,S T A Y )}=E[limF(W T AY,S T Ar)]=hmE[F(W T A Y ,S ,r)}. 
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The result follows from the previous case on comparing a n with t ay . 

The proof for the lower bound is identical, e xcep t that there is no need to separate the case 
where there is an atom at x, since by Lemma 3.5 (r — a + (r)) -1 is integrable near zero and 
hence the fact that \i has bounded support is sufficient for A a + < oo. 

□ 

There are a parallel pair of results based on F-MONJ,, the proofs of which are very similar. 
Lemma 5.9. Suppose F satisfies F-MONl. Then, for all w < s 

^H + ^(w,s) < F(w,s) < £ a +(w) + $ a+ (w,s) 
with equality on the right at s = w and s = a(w) and equality on the left at s = b(w). 
Theorem 5.10. Suppose F-MONl holds. Then 

inf E[F(W t ,S t )]=E[F(W t ay,S t ay)], 

T£S(W,fl) n ft 

sup E[F(W T ,S T )\ = E[F(W t p,S t p)]. 
Tes VI (w,n) " " 

Example 5.11. Suppose fx = U[—l, 1] and F(w, s) = (s—w) c fore > — 1 (with c ^ 0). Then for 
c > 2 F-MONl holds, for0<c<2 F-MONl holds, and for -1 < c < 0, F-MONl holds again. 
Write B AY and B p for the bounds associated with the Azema-Yor and Perkins embeddings. 



Recall the expressions for (3 and a from Examples 3.1 and\3. 
For the Azema-Yor embedding, (3(s) = 2s — 1 and the law of the S T/3 is a uniform on [0, 1]. 
The associated bound (as a function of the parameter c) is given by 

B ay (c)=E[F(W t a Y ,S t a Yu )] = J\ h (w)-w) cd ^ = J\s- l 3{s)rds = s fds = 

For the Perkins bound, note that for c < 0, F(s,s) = oo, and so B p (c) = 0. For c > 0, 
F(s, s) = and using the substitution w = a + (s) = s — 2y/s, 



B p (c)=E[F(W rP ,S rP )] = j\a+(w)-wy u 2 



c dw 



2 



c 



(c+l)(c + 2)' 

Note that for c = 2, B AY (2) = B p (2) = 1/3 and all uniformly integrable embeddings for the 
terminal law are consistent with the same expected payoff. The reason for this will become clear 
in Section [?] and will correspond to the choice g = 1 . 

Example 5.12. Suppose again that u = U[—l,l]. Let F(w,s) = ( s ~™^ . Note that for each 
c either F-MONl or F-MONl (or both) holds, so that the Azema-Yor and Perkins embeddings 
give extremal values for K[F(W T , S T )]. Consider the Azema-Yor bound as a function of the 
parameter c (defined for c < 1): 



b{w) c 2 J s c J s c (l-c)(2-c)(3-c) 
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Figure 3: All uniformly integrable embeddings have the same expected value when c = 2. Note 
the reversal of the bounds at c = 2: for < c < 2 Theorem 5.3 applies while for c > 2 Theorem 



5.10 applies. For c < 0, the Perkins bound is infinite and the Azema-Yor bound is finite. The 
Perkins bound as a function of c is discontinuous at c = 0. 



For the Perkins bound we have (for c < 3/2) 

r0 (a + (w) — w) 2 dw 



B p (c) 



l a+{w) 



Jo s 



1 



(3/2-c)(2-c) 




Figure 4: For 1 < c < 3/2 the Azema-Yor upper bound is infinite while the Perkins lower bound 
is finite. 



6 General centred target measures 



Theorem 6.1. Fix r G Sui(W, fx). Suppose, in addition to Assumption 5.1, that F > 0, that 

E [F(W H±n ,S H±n );r> H ±n ] -»■ 0, (6.1) 
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and that if (/z n ) n >i is any sequence of measures which is increasing in convex order for which 
u„ =>• n, f7 M „(0) ->■ f7 M (0) and /%({0}) -> /i({0}) then both 



E 



E 



E 
E 



F(W t p,S t p 



and 



(6.2) 
(6.3) 



TTien, i/ F-MON^ holds 



E 



F(W t p, S» < E [F(W T , S T )] < E F(W t ay,S t , 



whereas, if F-MON^ holds then 



E 



F(W T AY, S t ay) < E [F(W T , S T )] < E F(W T r, S t p) 



Proof Suppose F-MON| holds (the proof for F-MON| is similar). Given r E 5c//(W,/x), let 
cr n = r A H± n , [i n = C(Wcr n ) and define r^ y and to be the Azema-Yor and Perkins stopping 
times associated with fi n . 

We have, using monotone convergence, (6.1), Theorem 5.3 and finally (6.2), 

E[F(W T ,S T )} = E[limF(W t7n ,S an );a n = T<H ±n } 

= limE[F(W CTn ,S CT J/ {T < ff±n} ] 

= hmE [F(W an ,S an )I {T<H±n} + F(W H±n ,S H±n )I {T > H±n} ] 

= limE[F(^ CTn ,^J] 



< limE 



F(W t ay,S t ay] 



E 



F(W T AY,S T ; 



Similarly 



limE [F(W<r n ,S an )] > limE F(W t p , S t p 



E 



F(W t p,S t p) 



□ 



Corollary 6.2. Suppose that F(w, s) < A(l + \w\ k + s k ) for k > 1 and i/iai // /ias finite k + e 
moment, for some positive e. Then the hypotheses (6.1), (6.2) and (6.3) are a// satisfied, and 
the conclusions of Theorem 6.1 hold. 

Proof By Doob's submartingale inequality for (| WtAr\ k+e )t>o, for any r E <S[/-/(W, //) 

m k+e F(r > H± m ) < E[\W T \ k+e ] < oo. 

Then 

E[F(W H±n ,S H±n )-T>H± n ] <A(l + 2n k )F(r>H± n )^0. 



For (6.2) we have that Tp n — > Tp almost surely. Moreover, since fj, n < cx fj, there exists a 
stopping time (p n say) with p n > Tg n and p n E <S[//(W,/z). For such a p n , E[|W p J fc+e ] = 
f R \x\ k+e fj,(dx) < oo by hypothesis, and then (defining W* = sup s<t \W s \) by Doob's LP in- 
equality E[(W*J k+e ] < D < oo for some constant D, independent of n. 

Set F n = F(W ay, S ay) and F = F(W ay, S ay), then F n — > F almost surely. The goal is 
to show that E[F n ] — > E[F] which will follow if sup n E[(-F n ) p ] < oo for then (F n ) n >i is uniformly 
integrable. We have that if \w\ < x and s < x then with p = 1 + k/e, 



F{w, sf < A p (l + 2x k f < A p 3 p {l + x 



18 



Hence 



E [FP] < A*3P(1 + E[(W*J k P}) < AP3P{1 + E[(W;j k P}) < ^ P 3 P (1 + D) < oo. 



For (6.3), consider a subsequence n(k). Then down a further subsequence r^ n — > almost 
surely and down this subsequence (6.3) holds by identical arguments as in the case for the 
Azema-Yor embedding. Hence (6.3) holds. □ 



7 Objective functions as running costs 

Our original aim in studying functions F(w , s) was as an aid in the analysis of the expected val- 
ues of integrals of the form g(St)dt. Motivated by the variance swap problem in mathematical 
finance we asked: 

Given g and //, what is the range of possible values of E[/ T g(S u )du] over embeddings 
t of ii in Brownian motion. 

Our aim is to reduce this problem to the case previously considered, but to use the extra 
structure to prove more powerful results under weaker hypotheses. 

The expected value of J Q r g(S u )du is intimately related to the value of K[G(W T , S t )] where 
G(w, s) = (s — w) 2 g{s). Indeed, if g is continuously differentiable, then by Ito's Lemma, 



G(W T ,S T ) = G(0,Q) 



g(S u )du 



2(S U - W u )g(S u )dW u , 



(7.1) 



so that if g(0) is finite (and then G(0,0) = 0), and if (Jq M 2(S U — W u )g{S u )dW u )t>o is a 
uniformly integrable martingale, then E[/ g(S u )du] = K[G(W T , S T )]. 

If g is increasing (respectively decreasing) then G satisfies G-MONJ, (respectively G-MONf) 
and we can apply the results of previous sections to deduce that the Azema-Yor and Perkins 
solutions give bounds E[/ Q T g(S u )du] over embeddings r of //. 

Theorem 7.1. Suppose g : R + \-t M + is a positive function and that /i is centred, 
(i) Suppose g is increasing. Then, 



and 



and 



inf E 

res(w,ti) [j 


/ g(S u )du 
'o 


= E 


rr AY 
1 

/ g{S u )du 

'0 


sup E 


/ g{S u )du 
Jo 


= E 


1 

/ g(S u )du 
Jo 


decreasing. Then, 








inf E 


/ g(S u )du 
Jo 


= E 


/ g{S u )du 
Jo 


sup E 

res v i(w,ii) 


/ g(S u )du 

JO 


= E 


/ g{S u )du 
Jo 



19 



Remark 7.2. As we remarked in the introduction, at first sight this result is counterintuitive. 
Given increasing g, the Azema-Yor stopping time maximises E[g(5V)] over r G Sjji(W, p), and 
it seems plausible that it might also maximise E[J* T g(S u )du]. In fact the exact opposite is 
true. The explanation is that for the Azema-Yor embedding there is co-monotonicity between 
S T and W T , and conditional on S T > s, the stopping time occurs quite soon (and certainly 
before W drops below /3(s)), whereas for the Perkins embedding, conditional on S T > s, there 
are paths which will only be stopped when W goes below a + (s). Thus, for increasing g when 
we wish to maximise the time (before r) for which S is large, this is best achieved by the 
Perkins embedding: although relatively few paths will have large S (most will have already 
been stopped) those with a large maximum will spend a long time after first hitting s before 
being stopped. 



Example 7.3. Recall Example 5.11 Suppose p, = U[— 1,1] and g(s) = s c . Then, for c < 0, 
(1 - c)" 1 (2 - c)- x (3 - c)~ l < E[/ r S~ c du} < (2 - c)" 1 (3/2 - c)" 1 . 

For < c < 1, (2 - c)- x ( 3 / 2 - c) _1 < E[/ r S~ c du] < (1 - c)" 1 (2 - c)" 1 ^ - c)" 1 , /or 
1 < c < 3/2, (2 - c)~ 1 (3/2 - c)- 1 < E[/ T S~ c du\ < oo, and /or c > 3/2, E[Jg" S-°du] = oo /or 

embeddings r. 

Note that for c = 0, E[t] is independent of r and equal to the variance of fx. 

Example 7.4. Recall the calculations from Example \3A\ Let the target law /i with support 
[— l,oo) satisfy /j,(dx) = i^m dx- Let g{s) = -^- s for c > hich is decreasing in s. 
The Azema- Yor upper bound can be calculated explicitly to be 

J -I b(w) + c (w + 2) 3 
2(log(c)-log(2)) 
c- 2 

The expression for the Perkins lower bound is given by 

J_ x a+{w) + c (w + 2) 3 

The expression for a + is too complicated for the expression above to have an analytic represen- 
tation. However, the values can be computed numerically for different c. 



The rest of this section is devoted to a proof of Theorem 7.1 We split the proof into four 
separate parts. 

Proof of Theorem |7.1| (i): Lower bound 

Suppose first that g is monotonic increasing and that we are interested in minimising the 
quantity Ef/J" g(S u )du] over embeddings t of /i in W. Note that it is sufficient to restict 
attention to Sjji(W, fj,): for non-minimal r G S(W,[jl) there exists f < r with f G Sui(W, (J,), 
and then g(S u )du > g(S u )du for each to G fi. 

Suppose temporarily that g is bounded and continuously differentiable. Later we will relax 
this assumption. Then G(w,s) = (s — w) 2 g(s) satisfies G-MON|. 

For r G Suj(W,fi) let a n = r A H± n , let \x n = C(W an ), j3 n be the inverse barycentre of 
pL n and finally let be the Azema-Yor stopping rule associated with the law fi n so that 



T vl = T Pn = inf{u : W u < f3 n (S u )}. Then, by Proposition |L4j since U^ n f U^, Tp n -)■ Tp almost 
surely. 

If a stopping time p is such that p < H± n then E[p] < oo and for u < p, (S u — W u )g(S u ) 
is bounded. Then, if M t = Jq(S u — W u )g(S u )dW u we have that (M[) t >o is an L? bounded 
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martingale for which 



It follows that 



E[M; 



E 



{S u ~ W u )g(S u )du 



(7.2) 



E 



g(S u )du 



> 



E[(S, 
E[(5. 
E 



W an ) 2 g(S an )] 
-W Tfj Jg(S T J] 



g{S u )du 



where we have used (7.1) and (7.2) twice and Theorem 5.10. Then it follows from the Fatou 
Lemma that 



limE 



g{S u )du 



> limE 



g(S u )du 



> E 



liminf 



T /3n 



g(S u )du 



(7.3) 



almost surely, E [f$ g(S u )du\ > 



and by monotone convergence and the fact that r^ n — > 
E [Jq B g(S u )du\ as required. 

Finally we remove the temporary assumptions on g. Given g is monotonic increasing we 
can find an increasing sequence of bounded continuously differentiable (increasing) functions 
g m which approximate g from below. Then, by monotone convergence 



E 



g{S u )du 



limE 



9m{S u )du 



> limE 



9m{S u )du 



E 



g{S u )du 



Note that this same argument will apply in all four parts of Theroem 7.1 and henceforth 
without loss of generality we will assume that g is continuously differentiable and bounded by 
9- 

Proof of Theorem |7.l] (ii): Lower bound 

Case 1: There exists an open interval / C [—1, 1] containing with /u(I) = 0. 

Given r G S(W,fi), let a m = r A H± m . Let ^ m = £(W am ). Write for the Perkins 
embedding of fj, m . Note that fj, m =4> /i, L^ m (0) -)• ^(0) and /i m (I) = 0. Then, = T Qm 
and by Proposition |4.6|a), r Qm — > r a almost surely. Then exactly as in (7.3), but now using 



Theorem |5.3| to give that the lower bound is attained by the Perkins embedding, we conclude 

p 

that E [Jq g(S u )du] > E f " g{S u )du 
Case 2: General fi. 

Given any subsequence, by Proposition |4.6[ b) we may take a further subsequence down which 
T m ~^ T a almost surely. Then down this subsequence the result holds, as in Case 1. Since the 
first subsequence was arbitrary we are done. 
Proof of Theorem |7.1| (ii): Upper bound 

Now consider the upper bound in Theorem |7.1[ ii) . Rather than attempting to find a domi- 
nating random variable which will allow us to use the Reverse Fatou Lemma in place of the 
Fatou Lemma above we will use a slightly different approach based on defining a sequence of 
intermediate stopping times. 

Let r be any element of Sui(W,fi). Suppose g is bounded, continuously differentiable and 
monotonic decreasing, and that \i has support in a bounded interval Then, as above, 

E[/ r g(S u )du] =E[G(W T ,S T )]. Moreover, we can conclude from Theorem 



5.3 



that 



sup E 



g(S u )du 



E 



g(S u )du 
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It remains to remove the assumptions on [x. 

Given e, let U e (x) = max{J7 M (x) — e, \x\} and let x t and x e be the left and right-hand 
endpoints of the interval I e = {x : U e (x) > \x\}. 

Let a e = r A inf{« : W u ^ I t }. Let p, e be the law of W ae and let U e be the associated 
potential. Then U e = U e on I£ and U e < U t < U^. 

Now let C/ e be the largest convex function such that U e (x) = \x\ on I£ and f/ e < J7 M . It 
follows that U e is actually equal to U on an interval I 6 = [c e ,d e ]. If e is small enough then 
£ I £ . See Figure [TJ Further, U t < U t < U t < U and in terms of the associated measures 

<cx A e < ca . /ie < cx fi, where /i e is such that = U e and we recall that < cx denotes 'less 
than or equal to in convex order'. Then, by a theorem of Strassen |21j (or for a more explicit 
construction in our context, Chacon and Walsh [5]), given a t there exists a stopping time a e 
such that a e < a € almost surely, and jl e = C(Wa e ). 

Now consider a sequence e n decreasing to zero. Let /3 e „ be the inverse barycentre associated 
with jl en and let f n be the Azema-Yor stopping time associated with f3 tn . The introduction 
of the stopping times a €n gives extra structure which means that not only do the barycentres 
converge (as in Proposition 4.4), but also that they converge monotonically. 




Figure 5: The potentials U e increase monotonically as e decreases. Moreover, over a range of x, 
depending on e n , we have f3 €n (x) = f3(x), and hence, the inverse barycentre functions converge 
monotonically. 



Lemma 7.5. f3 n J, /3 and f n f rg almost surely. 

Proof: Write x n (respectively x n ,c n ,d n ) for x tn (respectively x tn , c en , d €n ) . 

Then, for s < b(c n ), /3 n (s) = x n > /3(s), for b{c n ) < s < x n , /3 n (s) = /3(s) and for s > x n , 
Pn(s) = s>/3(s). 

Monotonicity in n of f n follows immediately. □ 
It follows from the results for bounded target distributions that 



E 



g{S u )du 



< E 



g{S u )du 



E [G(W dn , 5*J] < E [G(W fn ,8 fn )] = E 



g{S u )du 



We have that the integral inside the first expectation converges monotonically to f g(S u )du, 
whereas the integral inside the final expression converges monotonically to Jq T/3 g{S u )du. Hence 
E[f r g(S u )du] < E[/ T/3 g(S u )du] as required. 
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Proof of Theorem |7.l] (i): Upper bound 

The final element of Theorem 7.1 is the upper bound in the case of monotonically increasing g. 
Recall that we suppose that g is continuously differentiable, and bounded by g. 

If /x has bounded support then Theorem 5.10 applies directly, so we assume that the support 
of /i is unbounded. 

If fj, L 2 then for each r G S(W, fi) we have E[t] = oo and using the fact that K[H e A t^] < 

K[H e Aff a +( £ )] < oo we have that E[/ '' g(S u )du] > g(e) E[/j/ At p du] = oo, and there is nothing 
to prove. 

So suppose \x G L 2 . Then the area between the curves U^x) and |x| is finite. 
Let U t (x) = max{[/ M (x) — e, \x\} and related quantities be defined as above. 
This time, since U e = on I e we have that 
form f t = [c e , d € ], and as e i 0, I e increases to the support of fi. 
Now 



on some sub-interval I e C / e of the 



and 



E 



E 



g{S u )du 



g(S u )du 



< E 



limE 



g(S u )du 



g{S u )du 



< E 



g(S u )du 



But 



E 



r p (/i £ ) 



g(S u )du 









= E 


/ g{S u )du 


+ E 




Jo 





g(S u )du 



Since a/} e = on / e and we have that r p (/2 e ) A //c 6 A is monotonically increasing as e ! 

r p / \ 

and hence the first term on the right-hand-side converges to E Jq~ w g(S u )du . Meanwhile, 

the second term is bounded by g"E[T P (jl e ) — r p (/i e ) A H4 A Hi }. This last quantity is at most 
g multiplied by the area between the potentials and where fi e = C{W T p^^ AHiAH ?). 



p r \ 

However, as e tends to zero this area tends to zero. Hence E [Jq g(S u )du\ < E Jq g{S u )du 



8 An application and extensions 



8.1 Variance swap on the sum of squared returns 

We now return to the question which originally motivated this paper which was to find model- 
independent bounds for variance swaps given the terminal law of the underlying asset price 
process or equivalently, call prices with expiry T for all strikes. Using the results developed in 
this article we will show how to bound the idealised variance swap based on squared returns, 
introduced in Section [2] The results in this article motivated further work on model-independent 
bounds and hedging strategies for variance swaps in a general setting, see Hobson and Klimmek 

m]. 

As in Section [2j let X = (^)o<t<T be a square-integrable martingale started at Xq = xq 
with Xt ~ jU, where \x is centred at xq and supported on R + . Recall from ( |2.1[ ) the definition 
for the payoff of an idealised variance swap Vr = V((X s )q< s <t) = J Q T (Xt~)~ 2 d[X, X] t . By 



(2.4) and (2.5) we have 



inf E 



du 

(^u) 2 . 



< E[V T 



< sup E 



du 



23 



Let p, be the measure \i reflected around 0, so that /2 is a measure on E_, and observe that 



sup E 

t&Sui{B,ii) 



sup E 

T£Sui{B,jj) 



T du 



where B is a Brownian motion started at — xq, with maximum process S B . Now we apply 
Theorem 17.11 to see that 



E 



du 



< E[V T ] < E 



du 



Note that the Perkins embedding for is determined by the monotonic functions a~ where 

Example 8.1. Suppose that Xq = 1 and \i = £/[0, 2]. Shifting the quantities calculated in 
Example 3.1 to allow for the starting value Xq = 1 it is clear that a+ : [1, 2] — > [0, 1] is defined 



a+(s) = s — 2\J s — 1 and : [0, 1] — > [1,2] is defined 
bound can be calculated; 



y/1 — i. Hence the lower 



E 



p 

r f du 








Jo Si 


= E 


(-if) 





1 



dx 7T 



2 log 2. 



a+{x)J 2 2 

For £/te upper bound; first considering g € (s) = s~ 2 A e -2 and f/ien letting e 1 0, 



E 













= E 


[(-1)1 




yo s 2 _ 







1 



a M (x) 



dx 



oo. 



8.2 Extension to diffusions 

Suppose that (X t )t>o is a time-homogeneous diffusion on I C R. More specifically, let a : I — >■ 
(0, oo) and 6 : I — >■ M be Lipschitz functions and define {Xt)t>o to be the solution to 

dX t = o-{X t )dB t + b(X t )dt, X = x , 

where (Bt)t>o is a Brownian motion. 

Let s : I — > M be the strictly increasing and C 2 scale function of X, 

s(xo) = 0, s'(x) = exp ( — f 2 ^, U . du) , 

V Jo <n«r / 

and let /i = 

Consider the problem of maximising (or minimising) ^[F(X T , over minimal embeddings 
r of \x. Since Mj = s(-Xt) is a local martingale it follows that it can be represented as M< = PF^m, 
for some (continuous) time-change t — > A(t). Define the measure v by v{G) = fj,(s~ 1 (G)) for 
Borel sets G C s(I). Notice that a is a minimal embedding of v in W if and only if r = A^ 1 (a) 
is a minimal embedding of v in M and hence a minimal embedding of fi in X. 

Define the function F by F(w, s) = F(h(w), h(s)). Then, 

F(X T , S?) = ¥(h(W Ar ), h(S Ar )) = F(W At ,S At ). (8.1) 
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Lemma 8.2. Suppose F satisfies F - MON f. Then F satisfies F - MON f if F s < and h 

is concave or if F s > and h is convex. 

Similarly, suppose F satisfies F — MON \.. Then F satisfies F — MON J, if F s < and h 
is convex or if F s > and h is concave. 

Proof. The result follows from the expression 

F s (x, s) h'(s)F s (h(x),h(s)) h(s) - h(x) 



12) 



s — x h(s) — h(x) s — x 

□ 

Note that h is convex (concave) when s is concave (convex), and since 2s"(x)/s'(x) = 
—o~(x) 2 /b(x), the scale function is concave if b(x) > for all x. 

Proposition 8.3. Suppose v = fx o h is centered about zero and suppose b > 0. Suppose F 
satisfies F — MON f and is increasing in s. Then 

sup E[F(X T ,S?)] = E[F(W t ay,S t ay)} 
inf E[F(X T ,S?)] = E[F(W t p,S t p)]. 

Remark 8.4. Whilst necessary to apply the results of the Brownian setting, the assumption that 
v = fi o h is centred is not as innocuous as might first appear, and in the setting of a transient 
diffusion it is natural to wish to consider embeddings for target laws which, after transformation 
by the scale function, are not centred. For example, let X be a three-dimensional Bessel process, 
started at one. Then s(x) = —1/x + 1 and h(m) = 1/(1 — m). Now let \i be any probability 
measure on M + with J" R+ x~ 1 fi(dx) < 1. Then, there exists a minimal embedding of [i in X, but 
only if J K+ x~ l fi{dx) = 1 does this embedding correspond to a uniformly integrable embedding 
of M= l-X- 1 . 

See Cox and Hobson [6] (and the references therein) for a further discussion of this issue, 
and of the construction of embeddings in Brownian motion of non-centred target laws. 
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